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Abstract 

The information carrying capacity of the d-dimensional depolarizing 
channel is computed. It is shown that this capacity can be achieved by 
encoding messages as products of pure states belonging to an orthonormal 
basis of the state space, and using measurements which are products of 
projections onto this same orthonormal basis. In other words, neither 
entangled signal states nor entangled measurements give any advantage 
for information capacity. The result follows from an additivity theorem 
for the product channel A where A is the depolarizing channel and 
^' is a completely arbitrary channel. We establish the Amosov-Holevo- 
Werner p-norm conjecture for this product channel for all p > 1, and 
deduce from this the additivity of the minimal entropy and of the Holevo 
quantity x* ■ 
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1 Background and statement of results 



1.1 Introduction 

This paper computes the capacity of the d-dimensional quantum depolarizing 
channel for transmission of classical information. The result confirms a long- 
standing conjecture, namely that the best rate of information transfer can be 
achieved without any entanglement across multiple uses of the channel. It is 
sufficient to choose an orthonormal basis for the state space, and then use this 
basis both to encode messages as product states at the input side and to perform 
measurements at the output side which project onto this same basis. In this 
sense the depolarizing channel can be treated as a classical channel. 

The Holevo-Schumacher- Westmoreland theorem allows the capacity to be 
expressed in terms of the Holevo quantity x*- If the Holevo quantity is additive, 
then this expression implies that the capacity is actually equal to x* ■ this 
paper we prove several additivity properties for the depolarizing channel, includ- 
ing the additivity of x*- One important mathematical tool used in the proof is 
the Lieb-Thirring inequality, which provides a bound for the non-commutative 
p-norm of a product of positive matrices. These notions, as well as the definition 
of channel capacity and its relation to the Holevo quantity, are described in the 
following subsections. 



1.2 The depolarizing channel 

The depolarizing channel is a particularly simple model for noise in quantum 
systems 0], and has been studied in a variety of contexts 0, ^, ||. In (i 
dimensions the model is implemented by a completely positive trace-preserving 
map Aa, depending on one real parameter A, which maps a state p on into 
a linear combination of itself and the d identity matrix /: 

AA(p) = Ap + i^/ (1) 

The condition of complete positivity requires that A satisfy the bounds 

^ < A < 1 (2) 



d^-l 



The channel Aa maps a pure input state to a mixed output state. Because 
the channel is highly symmetric, all such output states are unitarily equivalent. 
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and have eigenvalues X + {1 — X)/d (with multiphcity 1) and {1 — X)/d (with 
multiphcity d — 1). 



1.3 Measures of noisiness 

We will use three measures of noisiness for quantum channels, namely the mini- 
mal output entropy S^[n, the maximal output p-norm z/p, and the Holevo quan- 
tity X*- First recall that a state p is a positive operator with trace equal to 1, 
and its von Neumann entropy is defined by 

^(p) = -Trplogp (3) 

The minimal output entropy of the channel \1/ is defined as 

^^i,(vl>) = inf^(vl/(p)) (4) 

p 

Recall also that the p-norm of a positive matrix A is defined for p > 1 by 

\\A\\p= (TtAP) (5) 

Amosov, Holevo and Werner introduced the notion of the maximal p-norm 
of a channel as a way to characterize its noisiness. This quantity is defined as 

z/p(vl/)=sup||v|/(p)||p (6) 
p 

Since the entropy of a state is the negative of the derivative of the p-norm at 
p = 1, it follows that for any channel 

±.A^l__^ = -S^U^) (7) 

The third measure of noisiness, the Holevo quantity, is closely related to the 
information- carrying capacity of the channel, as will be explained in the next 
section. We will use the symbol S to denote an ensemble of input states for the 
channel, that is a collection of states pi together with a probability distribution 
TTj. The Holevo quantity is 

X*W = sup [^(^(p)) - ^7r,^(vl/(p,))] (8) 
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where p = Y1 '^iPi the average input state of the ensemble. 

These three measures can be easily computed for the depolarizing channel. 
The values are 



5'min(A> 



A + ^-7^) log f A + - 



d 



d 



A 



d 



(9) 



A + 



1 - A\p 



+ {d-l) 



i/p 



(10) 



X*(AA)=logrf-5^i,(A, 



The value ( pJ]) is achieved by choosing an ensemble £ consisting of pure states 
belonging to an orthonormal basis (it does'nt matter which one), and choosing 
the uniform distribution Hi = 1/d. Since the average input state for this ensem- 
ble is (l/d) I, the terms on the right side of (^ are separately maximized for 
this choice of ensemble, and this leads to the result (|Tl|). 



1.4 Additivity conjectures 

It is conjectured that Smm and x* are both additive for product channels. This 
would mean that for any channels \E'i and \E'2; 



5min(^l ® ^2) = 5min(*l) + 5„,in(^2) 



(12) 



and 



X*(^l®^2) =X*(*l)+X*(^2) 



(13) 



Equivalently, the conjectures would imply that for product channels both of 
these measures of noisiness are achieved with product input states. As we 
explain in the next section, the Holevo quantity x* is related to channel capacity, 
and in fact ([T3|) would imply that the channel capacity of an arbitrary channel 
\E' is precisely x*(^)- These conjectures have been established for some special 
classes of channels, including all unital qubit channels and all entanglement- 
breaking channels |T^. However a general proof has remained elusive. 



It was further conjectured in that the quantity Up should be multiplicative 
for product channels for all p > 1, reflecting the idea that the p-norm of a 
product channel would be maximized with product states. This conjecture 
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would imply {IT), since the entropy can be obtained from the derivative of the 
p-norm at p = 1. Again this conjecture has been established for some special 
classes of channels, in particular for unital qubit channels Q. However it is now 
known that the conjecture does not hold in general - this was demonstrated 



recently by the discovery of a family of counterexamples for values p > 5 ||16 
Nevertheless, in this paper we will prove that the AHW conjecture is true for 
any product channel of the form where is arbitrary, and we will show 
how this property implies the additivity of S'min and of x* for such a product 
channel. 



1.5 Channel capacity 

In order to relate these additivity results to the channel capacity problem, we 
recall first the definition of the capacity for a general channel \E'. Again we denote 
by S an ensemble of input states, and we also denote by a measurement, or 
POVM, at the output side of the channel. Recall that a POVM is a collection 
of positive operators {Ej} which sum to the identity matrix. When a state p 
is measured using a POVM {Ej}, the outcome j is obtained with probability 
TT{pEj). This notion generalizes the familiar von Neumann measurement, which 
is the special case where the operators Ej are orthogonal projections. 

The ensemble S, the POVM Ai and the channel \Ef together define a classical 
noisy channel, whose transition matrix is 

p,, = Tr[*(p,)E,] (14) 

If we write X for a random input signal with distribution tTj = P{X = i), then 
the output signal Y from this classical channel has distribution 

p(r = j) = J2^,p,^ (15) 

i 

The Shannon capacity of a classical noisy channel measures the maximum rate at 
which information can be reliably transmitted through the channel. Shannon's 
formula computes this capacity as the maximum of the mutual information 
I{X, Y) between an input signal X and its corresponding output signal Y given 
by (p!5D , where the maximum is evaluated over all choices of distribution {tTj} for 
X. For the case of a quantum channel, we are interested in the maximum rate 
that can be achieved using the optimal choices of input states {pi} and of output 
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measurements {Ej}. Therefore we are led to define the Shannon capacity of the 
quantum channel \1/ as 

Cshan(^) = SUp J(X,y) (16) 

where the distribution of the input signal X is determined by the ensemble S, 
that is P{X = i) = vTj, and where the output signal Y is determined by 
(p!5|). This can be easily evaluated for the depolarizing channel. The mutual 
imformation I{X, Y) in (|TBp is maximized by choosing an ensemble consisting 
of projections onto an orthonormal basis, and using the same basis for the 
measurement. The result is 

C^Shan(AA)=X*(AA) (17) 

where the capacity x*{'^\) is given by (|ll|). 

If two copies of the channel \1/ are available then it may be possible to achieve 
a higher rate of transmission by sharing the signals across the two channels. This 
possibility exists because quantum channels have an additional resource which 
is not available for classical channels, namely entangled states which can be 
used to encode signals for the product channel. It is also possible to make 
measurements at the output side using a POVM which projects onto entangled 
states. With these resources the best rate that can be achieved using two copies 
of the channel is 

^C^Shan(^ ® ^) (18) 

It is known that in general (|18]) is larger than (|16|) [H, ^ . This observation leads 
to the question of finding the asympototic capacity which would be achieved 
by sharing the input signals across an unlimited number of copies of This 
ultimate capacity is given by 

C„it(vl>) = hm iCshan(^^") (19) 

n^oo n 

(a standard subadditivity argument shows the existence of this limit). 

At the present time it is an open problem to determine Cuit(^) for an arbi- 
trary channel However it can be expressed in terms of the Holevo quantity 
(H). Recall that the Holevo bound implies that 

Cshan(^) < X*m, (20) 
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and hence that 

C^i.i^) < hm (21) 

Furthermore the Holevo-Schumacher- Westmoreland theorem shows that 

the rate can be achieved with multiple copies of the channel, by restricting 

to product states for the input signals, but allowing entangled measurements at 
the outputs. Applying this theorem to the product channel "^^"^ implies that 
the rate (1/n) x*(\&®") is achieved with input signals which may be entangled 
across n uses of the channel. Allowing n to be arbitrarily large leads to the 
result 

C,it(^) = lim -x*(^^") (22) 

n^oo n 

1.6 Statement of results 

Our first result is the evaluation of (|19D for the depolarizing channel. 

Theorem 1 The capacity of the d- dimensional depolarizing channel is 

C,it(AA) = x*(Aa) = Cshan(AA) = log ci - 5^in(A,) (23) 
where Smmi^x) is evaluated in 

The fact that Cuit(AA) = Cshan(AA) means that as far as the information- 
carrying properties of the depolarizing channel are concerned, there is no ad- 
vantage gained by using either entangled input states or using entangled mea- 
surements. The optimal rate can be achieved by choosing an orthonormal basis 
(because A^ is symmetric it does'nt matter which one) and using this basis to 
encode the signals and also to measure them. In this sense the channel behaves 
like a classical channel, and entanglement does not play any role in its capacity. 

The basic ingredient in the proof of Theorem |l| is the additivity of the Holevo 
quantity x* for the depolarizing channel, which we state in the next Theorem. 

Theorem 2 For any channel , 

X*(Aa®v&)=X*(Aa) + X*(^) (24) 
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Theorem ^ follows easily from this, as we now demonstrate. The result 
C^iti^x) = X*{^x) ill Theorem |l] follows immediately from Theorem ^ by 
choosing = A®" in (|2^ ) and applying (^). The second equality x*{^\) = 
Cshan(AA) was derived in ([T^). 

Finally we state the AHW conjecture for the depolarizing channel, which 
underlies all the other results. Since the derivative of t'p(^I/) at p = 1 is equal to 
— Sininl^), the additivity of Smm is a special case of the AHW conjecture. We 
state both results next in Theorem ^. 

Theorem 3 For any channel and any p > 1, 

Up{Ax ® ^) = Up{Ax) (25) 

and hence 

S^U^x ® ^) = S^U^x) + S^inW (26) 

Special cases of Theorem ^ were previously established, namely for integer 
values of p in all dimensions d , and for all p > 1 in dimension d = 2 . 

1.7 Organization 

The paper is organised as follows. Section 2 outlines the proof of Theorem ^, 
and states two key results which are used, namely the convex decomposition of 
the depolarizing channel, and the bound for the phase-damping channel. These 
results are then established in Sections 3 and 4, and finally Theorem ^ is proved 
in Section 5. Section 3 also describes in detail the convex decompositions for the 
two-dimensional qubit depolarizing channel. Section 6 contains some discussion 
of the nature of the proof, and some ideas about further directions to pursue. 



2 Outline of the proof 

2.1 Definition of phase-damping channel 

As discussed above. Theorem ^ follows immediately from Theorem using the 
HSW Theorem (p^ . Theorem |^ itself is a slight extension of Theorem ^ and 
will be proved in Section 5. Most of the work in this paper goes into the proof 
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of the AHW conjecture in Theorem ^ The proof presented here develops 
further the methods introduced in where the same result was established for 
unital qubit channels. The basic idea is similar: we express the depolarizing 
channel Aa as a convex combination of simpler channels, and then we prove a 
bound for these simpler channels which implies the result ([25|). In the simpler 
channels were unitarily equivalent to phase-damping channels, and we use the 
same name for the channels here, which are defined as follows. 

Definition 4 Let B = be an orthonormal basis, and let Ei = {ipil . 

The phase-damping channel corresponding to B is the one-parameter family of 
maps 

d 

<l>A(p) = Ap + (l-A)^E,pE, (27) 

i=l 

where the parameter X satisfies the bounds 

--^<A<1 (28) 

The parameter range (^) is required by the condition of complete positivity. 
If we write p = {pij) as a matrix in the basis then the channel (^) acts by 
scaling the off-diagonal entries and leaving unchanged the diagonal entries, that 
is 

We will express Aa as a convex combination of phase-damping channels, all 
with the same parameter A. Furthermore these phase-damping channels will all 
share a common property, which is expressed by the following definition. 

Definition 5 We say that a vector v = (f i, . . . , f^) in is uniform if \vi\ = 
\vj\ for all i,j = l,...,d. 

It will turn out that the phase-damping channels which arise in the convex 
decomposition are constructed from orthonormal bases B = {{ipi)} where all the 
vectors {ipi) are uniform. Since each vector lipi) is normalized, it follows that 
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all its entries have absolute value 1 / ^/d. As a consequence, if D is any diagonal 
matrix, then for any uniform state 



Tr W){i}\D = ^^\D\i)) = -TyD 



(30) 



Definition 6 Let $ he the phase-damping channel corresponding to the or- 
thonormal basis B. We say that $ is uniform if is uniform for every 

\A) e B. 



2.2 Three lemmas 

There are three steps in the proof of Theorem ^ The goal is to find a bound 
for ||(Aa ® ^)(ti2)|Ip which will lead to (P^D, where \1/ is any other channel, and 
Ti2 is any state. The first step is a partial diagonalization of the state ri2. This 
step uses the following invariance property of the depolarizing channel. 

Lemma 7 Let ti = Tr2(ri2) denote the reduced density matrix of T12, and let 
U be a unitary matrix. Define = (f/ ® I)ti2{U* ® /)• Then for all p > 1 



Proof: the definition of in ([l|) implies that the unitary matrix U ® I can 
be pulled through the channel ® and then the invariance of the p-norm 
implies (|l|). QED 

The second step uses the following result which expresses A^ as a convex 
combination of phase-damping channels. This result will be derived in Section 



Lemma 8 For n = 1, . . . ,2d'^{d + 1), there are positive numbers Cn, unitary 
matrices Un and uniform phase-damping channels such that for any state 
P 



||(A,®M/)(n2)||p=||(A,®M/)(r{2)||p 



(31) 



3. 



2d2((i+l) 




(32) 



n=l 
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The third step in the proof uses the following bound for the phase-damping 
channels. This bound will be derived in Section 4. 



Lemma 9 Let $a be a phase-damping channel defined as in 1^^, with corre- 
sponding orthogonal projectors Ei = \ipi){'ipi\. For an arbitrary bi-partite state 
Pi2 define 



pf = Tn {Ei ® /)pi2 



(33) 



where Tii is the trace over the first factor. Recall the factor i'p{Ax) from (f/^j. 
Then for all p > 1, 



i/p 



(34) 



1=1 



2.3 Proof of Theorem |^ 

We will now prove Theorem ^ using these three lemmas. Since the left side 
of ( [25| ) is at least as big as the right side, it is sufficient to prove that for any 
bipartite state T12 

||(Aa ® ^)(ri2)||p < z^p(Aa) z/p(^) (35) 

The first step is to use Lemma 1 to partially diagonalize the state tu- Let U 
be a unitary matrix which diagonalizes the reduced density matrix ri = Tr2(ri2), 
and let = {U ® I)ti2{U* ® /), so that t[ = UtiU* is diagonal. By Lemma 
1^ we can replace T12 by without changing the left side of (|35|). Therefore we 
will assume henceforth without loss of generality that ti is diagonal. 

The second step is to apply the convex decomposition (|32D on the left side 
of (H): 

2(i2(d+i) 

(AA®^)(ri2)= c„(f/:®/)(<fA^")®^)(ri2)(f/n®/) (36) 

n=l 



For the third step, notice that by convexity of the p-norm it is sufficient to 
prove the bound (^) for each term ($a*'"'* ® ^)(ti2) appearing on the right side 
of (|36D , namely 

mt''®^){ru)\\p<M^x)M^) (37) 
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In order to derive (^), we apply (Q) with 

pi2 = {I® ^)(ri2), p^2^ = ^(rf ) = ^(Tn [{E, ® 1)^2 
Therefore (0) gives 

i=l 

Now the definition of the p-norm z^p(^I/) imphes that for each i, 



1/p 



(3^ 



(39) 



1/p 



From the definition of Tn ^ it follows that 



Tr(r«) = TT{E,n] 



(40) 



(41) 



Furthermore in the first step we chose the state ri to be diagonal, and from 
Lemma | the phase-damping channels appearing on the right side of (^6|) are 
all uniform (recall Definition |^) . Hence from ( PD| ) we get 



Tr(E,ri) = ^Trn = ^ 
a a 

Inserting into (|3^) gives 

||($f ®^)(ri2)||p < S'-'M u,{A,) 



1\P- 



1/p 



(42) 



(43) 
(44) 



which completes the proof. QED 



3 The convex decomposition 
3.1 Proof of Lemma |9| 

The derivation proceeds in two stages. First we define a new channel fix which 
appears in an intermediate role: 



Qx{p) = Ax{p) + 



1 - A 
d 

12 



P - diag(p) 



(45) 



where diag(p) is the diagonal part of the matrix p. It is not hard to see that 
VLx is completely positive and trace-preserving for all A in the range (pH]). For 
example it can be re-written as 



1-A\ (c/-l)(l-A) 



d 



d 



I - diag(p) 



(46) 



and the map / — diag(p) /{d — 1) is easily seen to be completely positive and 
trace-preserving. Next let G be the diagonal unitary matrix with entries 



Gkk 

Gki 



exp 
0, 



d 

k ^ I 



l< k< d 



(47) 
(48) 



Lemma 10 For any matrix p, 
Xd 



Aa(p) 



1 + {d-l)X 



1-A 



l + {d-l)X d 



iX^(G*)^fi,(p)G^ 



(49) 



k=i 



Proof: A straightforward computation shows that for any matrix p 



^X;(G*)'=pG^ = diag(p) 



(50) 



k=l 



Applying the definitions of and Qx from (^ and (^), the result now follows 
easily. QED 

For the second stage in the derivation of ( |32D we express the channel Q\ 
itself as a convex combination of phase-damping channels. To this end define 
the diagonal unitary matrix H by 



Hkk 
Hki 



exp 
0, 



2'Kik'' 
k ^ I 



1< k<d 



(51) 
(52) 



and define the following pure state 16*): 



1^) 



Vd 



1 



(53) 
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For each k = 1, . . . ,d and a = 1, . . . , 2(f' we define the pure state 

\^Pk,a) = G'Hy) (54) 
and the corresponding orthogonal projection 

Ek,a = \i^k,a){i^k,a\ (55) 

For each fixed a, the states {\ilJk,a)} form an orthonormal basis. We denote by 
the corresponding family of phase-damping channels, that is 

d 

(p) = Ap + (1 - A) ^ Ek,apEk,a, a = 1, . . . , 2rf2 (56) 

k=l 



Lemma 11 



^A = ;;^E^?^ (57) 



a=l 



Proof: Using the definitions of fi;^, A;^ and ^^x^ ■, it suffices to show that for any 
state p 

^ 2d^ d 

^Y.Y1 ^k,apEk,a = I + P- diag(p) (58) 

a=l k=l 

For each x = 1, . . . , let |x) be the unit vector with entry 1 in position x, and 
elsewhere. Then it suffices to show that for all x, y 

^ 2d^ d . ^ -f = 

n — 1 i> — 1 ^ 



a=l k=l 

\th 



The (a, k) term on the left side of ( p9D can be written as 

{x\Ek,apEk,a\y) = {x\iJk,a){'ipk,a\p\'ipk,a){i^k,a\y) (60) 

d 

= ^ {x\tpk,a){^k,a\u){u\p\v){v\ipk,a){^k,a\y) 



u,v=l 
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Furthermore 



{x\^k,a) = {x\G'Hy) = —exp 



2'Kikx 
d 



exp 



Substituting into (|60| ) gives 

{x\Ek^apEk,a\y) 

1 ^ , I I \ r27ri/c , , 
^ 2^ (mIpI?^) exp — ^(x + t; - ?/ - u) 



(61) 



(62) 



u,v=l 



exp 



2nia 



2,2 2 2\ 



When the right side of ( p^D is substituted in (^), the sum over k gives zero 
unless x + f — y — uisan integer multiple of (i. Since x, v, y, u vary between 1 
and d, the only possible values are 



x + f — y — M = 0, (i, — c? 



(63) 



Similarly, the sum over a gives zero unless + f ^ — — -u^ is an integer multiple 
of 2d'^. In this case the only possibility is 



+ t;^ - - = 
Consider first the case that ( |63D gives 

x + v — y — u = d 
Let 7 = y + u, then a; + = 7 + rf, and hence 

2 < 7 < 

Elementary bounds then lead to 



+ f ^ > 7^ > + 



which shows that there can be no simultaneous solution of (|6^ and (pS]). A 
similar argument holds for the case 



(64) 



(65) 



(66) 



(67) 



x + v — y~u = —d 
The remaining case in ( |63|) can be written as 

X — u = y — V 



(68) 



(69) 
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and also ( |5^ can be written as 



(x — u){x + u) = {y — v){y + v) (70) 

It follows that the only simultaneous solutions of the equations ( |5UD and ( [7DD 
are x = y,u = v and x = u,y = v. Hence if x ^ y the left side of (^) gives 
{x\p\y), while if x = y the sum gives = Trp = 1. QED 

Combining Lemma ^ and Lemma |lT] we arrive at the convex decomposition 
of Aa, which expresses the depolarizing channel in terms of the phase-damping 
channels $a"^: 

^ ' a=l 

+ 2^1:1: 

Furthermore each state \il^k,a) defined in ( ^4] ) is uniform, and hence the phase- 
damping channels defined in (^) are also uniform. This completes the 
proof of Lemma ^ 

3.2 d = 2: the qubit depolarizing channel 

It is useful to look in detail at the familiar case d = 2. A general state p can be 
written as a 2 x 2 hermitian matrix 

The depolarizing channel (|I|) acts by 

A.w^r^-;/-" J. (73) 



where we have defined 



A. ^ (74) 
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Also the 'intermediate' channel Q\ defined in 



acts by 



A+a + A_6 A+c 
A+c A_a + A+6 



(75) 



-1 
1 



-0-2 



The first diagonal unitary matrix G defined in (47) is just 

G 

So the first relation ( ^9]) becomes 
Aa 



(76) 



2A ^ 1 - A 1 



1 + A 2 



1 + A 

which can be easily verified using ([73|) and ([TSl). 

The second diagonal unitary matrix H defined in (^) is now 



(77) 



H 



exp 









-1 



(7^ 



There are eight phase-damping channels defined in (^q) . Four of these can be 
written in terms of the usual Pauli matrices: 



(p) = $r(p) = A+p + A_a,pa, 

<l>?)(p) = <l>f (p) = A+p + A_a,.pa. 
The others can be written in terms of the following Pauli-type matrices: 



,(6) 



(79) 



^0) 







exp 



exp 



T 



exp 







exp 



- 7Ti/A 





The relations are 



,(5) 



^2) 



$f(p) = $V^(p) = A+p + A_rpr 



^3) 
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The second convex decomposition ( p7D now reads 



1 ^ 
8 ^ 

a=l 



(a) 
A 



^4) 



There is a lot of redundancy in the final decomposition (0), which now has 
24 terms on the right side. In fact Qx can be written as a convex combination 
of just two uniform phase-damping channels, namely 



(2) 



(4) 



^5) 



and this allows to be written as a convex combination of just four phase- 
damping channels. There may be a similar redundancy in (ITT]) for d > 2. 



4 The phase-damping channel 

In this section we will establish Lemma ^ for the product channel $a ® I- 
Without loss of generality we will choose the basis B = {\i)}, so that $a acts 
on a state by simply scaling all off-diagonal entries by the same factor A, as in 

d 

^,{p) = Xp+{l-X)J2\^){t\{t\p\t) (86) 

i=l 

The product channel $a ® / acts on bipartite states pu defined on C'' ® C^' 
for some dimension d'. It will be convenient to view these states as d x d block 
matrices, where each block is itself a d' x d' matrix. Furthermore there is a 
convenient factorization of these blocks, which can be derived by the following 
observation. Let us write ^/pl2 = {Vi . . . Vd) where each is a dd' x d' matrix. 
Then we have 



Pi2 = [Vpu] y/Pl2 (87) 





/Pl2 


V,*Vi 


... V^Vd 




••• VlVd 
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Recall the definition of in (^). With our choice of basis here, the matrix 
Ei®I is simply the orthogonal projector onto the i^^ block on the main diagonal, 
hence 

Pf = V* V, (88) 



The key to deriving the bound (|^) is to rewrite the factorization 
follows: 



Pl2 



/V{ 

Vi 



\ 





M 



V ... v,*J 

where M is the d x d block matrix 

r ... 



(Vx 

^2 

Vo 







as 



^9) 



M 



V 



and /' is the dd^ x dd^ identity matrix. Recall the state |^) defined in 
this we can rewrite M as the product state 



M = d 



(90) 
Using 

(91) 



Furthermore the simple action of the phase-damping channel $a implies that it 
acts on (PUf) in the following way: 



(<l>A®/)(p 
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(V* 
V* 



\ 







($,®/)(M) 



/V, 

V2 

Vo 



(92) 







vj 



Let us define 



A 





V2V2* 
\ 



\ 





(93) 
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and 



5 = ($, ® /)(M) = d (^^xm{0\)) ® r (94) 
Then ($a ® I){pi2) has the same spectrum as the matrix A^/'^BA^^'^ . Therefore 

Tr((<l>, ® I){pi2)f = Tx[A^'^BA^/^y (95) 
Now we use the Lieb-Thirring inequahty [jTU| , which states that for all p > 1 



Tif^A^'^BA^'^y < Tt(^Ap/^BPAp/^^ = Ti (^A^B^^ 
The matrix A^ is block diagonal: 



(96) 



/{ViV{y 



AP 



iV2V2*y 



V 



\ 





(97) 



Furthermore 



BP = dpUxm{e\)) ®i' 



{9t 



Explicit calculation shows that the diagonal entries of y^x{\(^){0\)j are all 

equal to (1 - A)^ + (rfA + 1 - A)^ - (1 - A)^ /d. Comparing this with and 
substituting (^TD and (|98|) into the right side of (|96[) we get 

Tri^APB^^ = S^-'^ [M^x)) J^^riV^^n' (99) 



i=l 



Now recall (P3), and also notice that for all i = 1, . . . ,d 



(100) 



Combining (H), (H) and (|10g) gives the bound (|§. QED 
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5 The additivity of %* 



The proof of Theorem ^ uses the representation of x* ^ min-max of relative 
entropy, combined with an entropy bound derived from Lemma The relative 
entropy representation was derived by Ohya, Petz and Watanabe [|12| and Schu- 
macher and Westmoreland JlSl. Recall that the relative entropy of two states p 
and uj is defined as 

S{p,uj)=TTp{\ogp-\ogu;) (101) 
The OPWSW representation for the Holevo capacity of the channel \1/ is 

X*(*) = infsup^(vl/(p),M/(^)) (102) 

= snpSh{p),^{uj*)) (103) 

where the state u* that achieves the infimum in ( |102| ) is the optimal average 
input state from the channel. For the depolarizing channel this optimal average 
is (l/d) I, that is the totally mixed state. For the product channel Aa ® the 
Holevo quantity x*(^a ® ^I/) can be upper bounded by choosing (l/d) I ^u* as 
the average input state. This leads to the following inequalities: 



X*(Aa) + x*W < X*(Aa ® ^) < sup 5 (Aa ® ^)(ri2), (l/d) I ® ^(u;*) (104) 

T12 ^ ' 

In order to prove additivity we will combine this with the following result. 
Lemma 12 For all bipartite states T12, 

^((AA®^)(ri2),(l/rf)/®^(u;*)) <x*(Aa) + X*(vI/) (105) 

Proof: 

The left side of (|105|) can be rewritten as 

5((AA®^)(ri2),(lM/®^(cu*)) = -S((AA®^)(ri2)) (106) 

+ logrf-Tr^(r2)log^(c<;*) 

where T2 is the reduced density matrix of T12. From here on we follow the steps 
in the proof of Theorem ^ First, by Lemma ^ we can assume without loss of 
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generality that ti = Tr2(Ti2) is diagonal. Second, notice that the channel 
appears on the right side of ( |106|) only in the first term. Therefore Lemma |^ 
and concavity of the entropy imply that it is sufficient to establish the bound 



5 (<I>A ® ^)(ri2), / ® ^(^*) < X*(A,) + x*m 



(107) 

where $a is a uniform phase-damping channel and where ti is diagonal. 

Next we apply ( ^41) with pi2 = (J® \E')(ri2), and take the derivative at p = 1 
to get 



5f(AA®^)(ri2)) > S^U^x)-\ogd 



(108) 



d d 2 

Xi log + XiS {^! {^—rf 
i=i i=i 



where x,- = Tr ( 



-('■) 



'2 ) 



and as usual 



Tri 



(E, ® /)ri2 



Since ri is diagonal and <I>a is uniform, it follows that 



_ 1 

Xi ~: 

a 



(109) 



(110) 



for alH = 1, . . . , (i, hence ^XjlogXj = — logci. Also recall the evaluation of 
X*(Aa) in (|n|). Hence ( p.08| ) can be written as 

1 

5((AA®vl/)(ri2)) >-x*(AA)+logd+-5^^(vl/(rfri'))) (111) 

j=i 

Furthermore, since the projections Ei in (|109|) constitute an orthonormal 
basis it follows that 



E 

i=l 



(/ ® /)Ti2 



7-2 



Therefore the left side of ( |107D can be rewritten as in (|10|) to get 



5(($A®^)(ri2),(l/rf)/®^(cu* 



(112) 



(113) 



1 

+ logrf - - ^Tr$(dri'^) log^(u;*) 
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Combining ( p.l3| ) with ( |1 1 1| ) we get 

5((A,®^)(pi2),(l/rf)/®^(^*)) < X*(Aa) (114) 



i=l 



Recall that TT{dT2^) = 1. Therefore it follows from (|102| ) that for each i 



l,...,d 



^(vl/(dr»),vI/(^*)) <x*W (115) 
and hence ([TTiD implies ([TOTl) . QED 

6 Conclusions and discussion 

We have presented the proof of a long-conjectured property of the d-dimensional 
depolarizing channel Aa, namely that its capacity for transmission of classical 
information can be achieved with product signal states and product measure- 
ments. This result follows as a consequence of several additivity results which 
we prove for the product channel Aa ® where is an arbitrary channel. The 
principal result is the proof of the AHW conjecture for the matrix p-norm, for 
all p > 1, from which we deduce the additivity of minimal entropy and of the 
Holevo quantity. The argument presented here is a generalization of the method 
used earlier by the author to prove similar results for all unital qubit channels, 
and involves re-writing the depolarizing channel as a convex combination of 
other simpler channels, which we refer to as phase-damping channels. 

If the additivity conjecture for the Holevo quantity is true for all channels, 
then there must be a general argument which can be used to provide a proof, 
and presumably this would give a different method of proof for Theorem |I|. 
However it is known that the AHW conjecture is not true in general [0, and 



indeed it is an interesting problem to determine the class of channels for which 
it does hold. As a consequence, it may be that the method of this paper gives 
the most direct route to the proof of the AHW property for the depolarizing 
channel. It is expected that the same method can be applied to prove the AHW 
result for a class of ci-dimensional channels, and this question is under study. 
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